Designing Good Semi-structured Databases
نویسندگان
چکیده
Semi-structured data has become prevalent with the growth of the Internet and other on-line information repositories. Many organizational databases are presented on the web as semi-structured data. Designing a \good" semi-structured database is increasingly crucial to prevent data redundancy, inconsistency and updating anomalies. In this paper, we deene a semi-structured schema graph and identify the various anomalies that may occur in the graph. A normal form for semi-structured schema graph, S3-NF, is proposed. We present two approaches to design S3-NF database, namely, restructuring by decomposition and the ER approach. The rst approach consists of a set of rules to decompose a semi-structured schema graph into S3-NF. The second approach uses the ER model to remove anomalies at the semantic level.
منابع مشابه
Designing Semistructured Databases: A Conceptual Approach
Semi-structured data has become prevalent with the growth of the Internet. The data is usually stored in a traditional database system or in a specialized repository. While many information providers have presented their databases on the web as semi-structured data, other information providers are developing repositories for new application. One such application is e-commerce, which is emerging...
متن کاملStanding on the Shoulders of Giants: digital platform designing with native XPath enabled storage and retrieval on MySQL
XML has become a standard for the exchange and retrieval of semistructured data between different platforms but is not currently a standard for storing semi-structured data in relational databases. There exist a number of different approaches in transforming or mapping semi-structured documents into the highly structured relational database schema. In this paper we do not present a new approach...
متن کاملHyperset approach to semi-structured databases and the experimental implementation of the query language Delta
This thesis presents practical suggestions towards the implementation of the hyperset approach to semi-structured databases and the associated query language ∆ (Delta). This work can be characterised as part of a top-down approach to semi-structured databases, from theory to practice. Over the last decade the rise of the World-Wide Web has lead to the suggestion for a shift from structured rela...
متن کاملOn the Information Content of Semi-Structured Databases
In a semi-structured database there is no clear separation between the data and the schema, and the degree to which it is structured depends on the application. Semi-structured data is naturally modelled in terms of graphs which contain labels which give semantics to its underlying structure. Such databases subsume the modelling power of recent extensions of flat relational databases, to nested...
متن کاملEfficient Frequent Pattern Mining Techniques of Semi Structured data: a Survey
Semi-structured data are a huge amount of complex and heterogeneous data sets. Such models capture data that are not intentionally structured, but are structured heterogeneously. These databases evolve so quickly like run time report generated by ERPs, World-Wide Web with its HTML pages, text files, bibliographies, various logs generated etc. These huge and varied become difficult to retrieve r...
متن کامل